add disaster-recovery component by filariow · Pull Request #10686 · redhat-appstudio/infra-deployments

filariow · 2026-02-27T17:17:43Z

add disaster-recovery to the development overlay
add eventlistener and cronjob

This only affects the development overlay

Signed-off-by: Francesco Ilario <filario@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

use tekton's eventlistener and trigger plus a cronjob to execute a pipeline every hour cf. https://github.com/tektoncd/triggers/tree/main/examples/v1beta1/cron Signed-off-by: Francesco Ilario <filario@redhat.com>

openshift-ci · 2026-02-27T17:17:47Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

openshift-ci · 2026-02-27T17:17:50Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: filariow
Once this PR has been reviewed and has the lgtm label, please assign simonbaird for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

github-actions · 2026-02-27T17:17:52Z

🤖 Gemini AI Assistant Available

Hi @filariow! I'm here to help with your pull request. You can interact with me using the following commands:

Available Commands

@gemini-cli /review - Request a comprehensive code review
- Example: @gemini-cli /review Please focus on security and performance
@gemini-cli <your question> - Ask me anything about the codebase
- Example: @gemini-cli How can I improve this function?
- Example: @gemini-cli What are the best practices for error handling here?

How to Use

Simply type one of the commands above in a comment on this PR
I'll analyze your code and provide detailed feedback
You can track my progress in the workflow logs

Permissions

Only OWNER, MEMBER, or COLLABORATOR users can trigger my responses. This ensures secure and appropriate usage.

This message was automatically added to help you get started with the Gemini AI assistant. Feel free to delete this comment if you don't need assistance.

github-actions · 2026-02-27T17:17:53Z

🤖 Hi @filariow, I've received your request, and I'm working on it now! You can track my progress in the logs for more details.

meyrevived · 2026-03-01T20:14:41Z

Hey @filariow, so adding disaster recovery to the development overlay is for the e2e-tests. In the e2e-tests, the backup and recovery are all done programmatically through Ginkgo code - it just needs the infrastructure to be available (MinIO + OADP sitting there, ready) not an automatic DR action.
The backup action has its own ApplicationSet, with cluster label selectors here. This PR does things in a completely different model - could you explain more about why this was is how you proposed to do things?

What e2e-tests needs for the DR effort is just to have a development/ dir here with MinIO + OADP manifests and also to ensure the existing backup ApplicationSet routes dev clusters to that overlay.

filariow · 2026-03-02T09:45:43Z

Hey @filariow, so adding disaster recovery to the development overlay is for the e2e-tests. In the e2e-tests, the backup and recovery are all done programmatically through Ginkgo code - it just needs the infrastructure to be available (MinIO + OADP sitting there, ready) not an automatic DR action.

what this PR adds is not to be used in the development overlay, it's meant for staging. In development we just want to test changes to its manifests are sound before they are promoted to staging. In the e2e-tests executed in the development overlay we can also suspend the cronjob. In staging it will be used to execute the test periodically.

The backup action has its own ApplicationSet, with cluster label selectors here. This PR does things in a completely different model - could you explain more about why this was is how you proposed to do things?

That's because for the scope of this PR we want to target dev only, not to all labeled clusters.

Signed-off-by: Francesco Ilario <filario@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

github-actions · 2026-03-02T09:57:50Z

Kustomize Render Diff

Comparing 565bdcd16 → 2320015cc

Component	Environment	Changes
`components/disaster-recovery/development`	development	+164 -0
`components/etcd-shield/production`	production	build error
`components/etcd-shield/production/kflux-fedora-01`	production	+1 -1
`components/etcd-shield/production/kflux-ocp-p01`	production	+1 -1
`components/etcd-shield/production/kflux-osp-p01`	production	+1 -1
`components/etcd-shield/production/kflux-prd-rh02`	production	+1 -1
`components/etcd-shield/production/kflux-prd-rh03`	production	+1 -1
`components/etcd-shield/production/kflux-rhel-p01`	production	+1 -1
`components/etcd-shield/production/stone-prd-rh01`	production	+1 -1
`components/etcd-shield/production/stone-prod-p01`	production	+1 -1
`components/etcd-shield/production/stone-prod-p02`	production	+1 -1
`components/monitoring/prometheus/production/kflux-fedora-01`	production	+614 -0
`components/disaster-recovery//empty-base`	staging	build error
`components/external-secrets-operator/staging`	staging	+3 -3
`components/monitoring/prometheus/staging/base`	staging	+1 -2
`components/monitoring/prometheus/staging/kflux-stg-es01`	staging	+1 -2
`components/monitoring/prometheus/staging/stone-stage-p01`	staging	+1 -2
`components/monitoring/prometheus/staging/stone-stg-rh01`	staging	+1 -2
`components/multi-platform-controller/staging`	staging	+2 -9
`components/multi-platform-controller/staging-downstream`	staging	+2 -9

Total: 20 components, +798 -38 lines

📋 Full diff available in the workflow summary and as a downloadable artifact.

Signed-off-by: Francesco Ilario <filario@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

meyrevived · 2026-03-02T10:54:07Z

@filariow

In development we just want to test changes to its manifests are sound before they are promoted to staging.

By what test? The test suites for e2e-tests might be disrupted by the cron job being triggered every hour and, it definitely will need to be suspended in the ITSes planned. Do you plan on something else?

filariow · 2026-03-02T11:07:24Z

@filariow

In development we just want to test changes to its manifests are sound before they are promoted to staging.

By what test? The test suites for e2e-tests might be disrupted by the cron job being triggered every hour and, it definitely will need to be suspended in the ITSes planned. Do you plan on something

just by the fact that ArgoCD can install them successfully and proceed running the e2e-tests. In the development overlay we can patch the cronjob to do not execute (spec.suspended: true) or patch the pipeline to run a no-op.
This way e2e tests running on every PRs won't be impacted and, before we promote them to staging, we'll validate that the changes to our manifests are sound and ArgoCD managed to apply them in the development environment .

eisraeli · 2026-03-02T11:27:04Z

@filariow
Given that we currently maintain the backup ArgoCD application within our disaster recovery scope, would it be worth considering a consolidation of these two components?

eisraeli · 2026-03-02T11:35:06Z

components/disaster-recovery/development/cronjob.yaml

+  name: run-disaster-recovery-pipelinerun
+  namespace: konflux-disaster-recovery
+spec:
+  schedule: "0 * * * *"  # every hour


@filariow Don't we want to run it daily ? Every hour is too frequent.

yeah, I agree it could be too frequent for the real use case. However, right now this targets the development overlay only and it executes a dummy pipeline. Let's agree on the right schedule when we target staging.

filariow added 2 commits February 27, 2026 13:20

add disaster-recovery to the development overlay

6d58ee9

Signed-off-by: Francesco Ilario <filario@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

add eventlistener and cronjob

d129409

use tekton's eventlistener and trigger plus a cronjob to execute a pipeline every hour cf. https://github.com/tektoncd/triggers/tree/main/examples/v1beta1/cron Signed-off-by: Francesco Ilario <filario@redhat.com>

openshift-ci bot added the do-not-merge/work-in-progress label Feb 27, 2026

filariow requested review from eisraeli and meyrevived February 27, 2026 17:17

github-actions bot added environment/development environment/staging labels Feb 27, 2026

fix linter

aac9b87

Signed-off-by: Francesco Ilario <filario@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

fix linter

2320015

Signed-off-by: Francesco Ilario <filario@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

eisraeli reviewed Mar 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add disaster-recovery component#10686

add disaster-recovery component#10686
filariow wants to merge 4 commits intoredhat-appstudio:mainfrom
filariow:add-dr

filariow commented Feb 27, 2026

Uh oh!

openshift-ci bot commented Feb 27, 2026

Uh oh!

openshift-ci bot commented Feb 27, 2026

Uh oh!

github-actions bot commented Feb 27, 2026

Uh oh!

github-actions bot commented Feb 27, 2026

Uh oh!

meyrevived commented Mar 1, 2026

Uh oh!

filariow commented Mar 2, 2026

Uh oh!

github-actions bot commented Mar 2, 2026 •

edited

Loading

Uh oh!

meyrevived commented Mar 2, 2026

Uh oh!

filariow commented Mar 2, 2026

Uh oh!

eisraeli commented Mar 2, 2026

Uh oh!

eisraeli Mar 2, 2026

Uh oh!

filariow Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

filariow commented Feb 27, 2026

Uh oh!

openshift-ci bot commented Feb 27, 2026

Uh oh!

openshift-ci bot commented Feb 27, 2026

Uh oh!

github-actions bot commented Feb 27, 2026

🤖 Gemini AI Assistant Available

Available Commands

How to Use

Permissions

Uh oh!

github-actions bot commented Feb 27, 2026

Uh oh!

meyrevived commented Mar 1, 2026

Uh oh!

filariow commented Mar 2, 2026

Uh oh!

github-actions bot commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Kustomize Render Diff

Uh oh!

meyrevived commented Mar 2, 2026

Uh oh!

filariow commented Mar 2, 2026

Uh oh!

eisraeli commented Mar 2, 2026

Uh oh!

eisraeli Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

filariow Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Mar 2, 2026 •

edited

Loading